alexott's Repositories

100 repositories

ace
Ace (Ajax.org Cloud9 Editor)
⭐ 0 🌐 Public
airflow
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
⭐ 0 🌐 Public
airflow-site
Apache Airflow Website
⭐ 0 🌐 Public
alexott.github.com
My site, http://alexott.net/
⭐ 4 🌐 Public
alia
High performance Cassandra client for clojure
⭐ 0 🌐 Public
anomaly_detection_using_databricks
No description
⭐ 6 🌐 Public
Awesome-SOAR
A curated Cyber "Security Orchestration, Automation and Response (SOAR)" awesome list.
⭐ 1 🌐 Public
azure-cosmos-db-cassandra-api-spark-connector-sample
Sample that provides guidelines and best practices for using the DataStax Spark Cassandra Connector against the Cosmos DB Cassandra API.
⭐ 1 🌐 Public
azure-cosmosdb-spark
Apache Spark Connector for Azure Cosmos DB
⭐ 1 🌐 Public
azure-event-hubs-spark
Enabling Continuous Data Processing with Apache Spark and Azure Event Hubs
⭐ 1 🌐 Public
beats
:tropical_fish: Beats - Lightweight shippers for Elasticsearch & Logstash
⭐ 1 🌐 Public
beats-zerobus
Databricks Zerobus support in Elastic beats (filebeat, ...)
⭐ 0 🌐 Public
boost-asio-examples
Source code for examples from article "What is Boost.Asio, and why we should use it", http://alexott.net/en/cpp/BoostAsioNotes.html
⭐ 85 🌐 Public πŸ“¦ Archived
boost-asio-proxy
Source code for examples from article "How to write simple HTTP proxy with Boost.Asio", http://alexott.net/en/cpp/BoostAsioProxy.html
⭐ 81 🌐 Public
cassaforte
Modern, high-level Clojure driver (client) for Cassandra build around CQL 3
⭐ 0 🌐 Public
cassandra
Mirror of Apache Cassandra
⭐ 0 🌐 Public
cassandra-dse-playground
Code samples for different components of DSE (DataStax Enterprise) & related technologies
⭐ 5 🌐 Public
cedet
My mirror of CEDET bzr repository (http://cedet.sf.net). Mostly used for experimental stuff, that will be merged into bzr version later
⭐ 15 🌐 Public
chaos
No description
⭐ 0 🌐 Public
chispa
PySpark test helper methods with beautiful error messages
⭐ 0 🌐 Public
clj-gsb
Clojure interface to Google's Safe Browsing API
⭐ 1 🌐 Public
clj-serializer
Fast binary serialization and deserialization for Clojure data structures
⭐ 2 🌐 Public
clj-tika
Clojure bindings to Apache Tika project
⭐ 24 🌐 Public
clojure
The Clojure programming language
⭐ 1 🌐 Public
clojure-course-ru-concurrency
Transcript & slides of lectures on concurrency in Clojure (for https://clojurecourse.by/)
⭐ 3 🌐 Public
clojure-examples
Different examples in Clojure - for articles, blog postings, etc.
⭐ 7 🌐 Public
clojure-hadoop
Library to aid writing Hadoop jobs in Clojure.
⭐ 98 🌐 Public
clojure-hbase-schemas
Schema-based HBase Interaction
⭐ 2 🌐 Public
clojure-libs
Different libraries for clojure
⭐ 6 🌐 Public
clojure-opennlp
Natural Language Processing in Clojure (opennlp)
⭐ 1 🌐 Public
clojure-semantic
Experiments with Emacs semantic.el and Clojure
⭐ 0 🌐 Public
courses
fast.ai Courses
⭐ 0 🌐 Public
cpp-tesing-examples
Examples for article on Unit testing with C++
⭐ 19 🌐 Public
cql-mode
Emacs mode for work with CQL (Cassandra Query Language)
⭐ 2 🌐 Public
cyber-spark-data-connectors
Cybersecurity-related custom data connectors for Spark
⭐ 2 🌐 Public
dabs-playground
Different examples around Databricks Asset Bundsls (DABs)
⭐ 3 🌐 Public
dasl-content-packs
No description
⭐ 1 🌐 Public
databricks-api
A simplified, autogenerated API client interface using the databricks-cli package
⭐ 1 🌐 Public
databricks-cicd-definitelynotademo
No description
⭐ 1 🌐 Public
databricks-cybersecurity-playground
Different pieces of code related to doing cybersecurity on Databricks
⭐ 4 🌐 Public
databricks-dbt-playground
Playing with DBT on Databricks
⭐ 4 🌐 Public
databricks-nutter-repos-demo
Demo of using the Nutter for testing of Databricks notebooks in the CI/CD pipeline
⭐ 152 🌐 Public
databricks-playground
Code samples, etc. for Databricks
⭐ 73 🌐 Public
databricks-repos-proxy
No description
⭐ 0 🌐 Public
databricks-sdk-go
Databricks SDK for Go
⭐ 0 🌐 Public
databricks-sdk-java
Databricks SDK for Java
⭐ 0 🌐 Public
databricks-sdk-py
Databricks SDK for Python
⭐ 1 🌐 Public
databricks-sql-connector-unofficial
Unofficial sources for Databricks SQL connector (until it's officially published)
⭐ 1 🌐 Public πŸ“¦ Archived
databricks-sql-python
Databricks SQL Connector for Python
⭐ 1 🌐 Public
datalake-ADLS-access-patterns-with-Databricks
No description
⭐ 1 🌐 Public
datastax-bootcamp-project
Source code for project from DataStax's bootcamp
⭐ 4 🌐 Public πŸ“¦ Archived
db-demo-project
demo project
⭐ 0 🌐 Public
dbx
CLI tool for advanced Databricks jobs management.
⭐ 1 🌐 Public
dbx-stable-url
A small Terraform Repository to create Stable URL infrastructure in AWS and Azure.
⭐ 0 🌐 Public
db_dlt_workshop
Databricks Delta Live Tables Workshop
⭐ 0 🌐 Public
deequ
Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.
⭐ 1 🌐 Public
delta
An open-source storage layer that brings scalable, ACID transactions to Apache Sparkβ„’ and big data workloads.
⭐ 2 🌐 Public
Delta-Live-Tables-Hands-on-Workshop
Delta Live Tables Workshop Resources
⭐ 1 🌐 Public
delta-live-tables-notebooks
No description
⭐ 0 🌐 Public
delta-live-tables-playground
Examples of Databricks Delta Live Tables
⭐ 0 🌐 Public
delta-sharing
An open protocol for secure data sharing
⭐ 1 🌐 Public
dlt-files-in-repos-demo
Demonstration of using Files in Repos with Databricks Delta Live Tables
⭐ 35 🌐 Public
dnks-terraform-lab
Curated collection of reusable Terraform snippets, samples, blueprints, examples, etc.
⭐ 0 🌐 Public
dns-analytics
No description
⭐ 0 🌐 Public
dotemacs
My personal Emacs configuration
⭐ 3 🌐 Public
dsbulk
DataStax Bulk Loader (DSBulk) is an open-source, Apache-licensed, unified tool for loading into and unloading from Apache Cassandra(R), DataStax Astra and DataStax Enterprise (DSE)
⭐ 0 🌐 Public
dse-java-playground
Playing with different pieces of DSE Java driver
⭐ 2 🌐 Public πŸ“¦ Archived
dse-search-tools
Utility classes for work with DSE Search
⭐ 0 🌐 Public πŸ“¦ Archived
ecb
!! It was moved to https://github.com/ecb-home/ecb !!!
⭐ 99 🌐 Public
emacs-addons
Repository of my packages (either written by me, or hacked by me)
⭐ 2 🌐 Public
emacs-configs
My personal Emacs configuration
⭐ 255 🌐 Public
emacs-guide-ru
No description
⭐ 14 🌐 Public
empythy
Automated NLP sentiment predictions- batteries included, or use your own data
⭐ 0 🌐 Public
fastbook
Draft of the fastai book
⭐ 0 🌐 Public
galimatias
galimatias is a URL parsing and normalization library written in Java.
⭐ 0 🌐 Public
gatling-dse-examples
Examples of using gatling-dse-plugin & gatling-dse-stress
⭐ 0 🌐 Public
geomesa
GeoMesa is a suite of tools for working with big geo-spatial data in a distributed fashion.
⭐ 0 🌐 Public
graph
Practical Gremlin - An Apache TinkerPop Tutorial
⭐ 0 🌐 Public
graphframes
No description
⭐ 1 🌐 Public
hbc
A Java HTTP client for consuming Twitter's Streaming API
⭐ 0 🌐 Public
incubator-sedona
A cluster computing framework for processing large-scale geospatial data
⭐ 0 🌐 Public
infer
inference and machine learning in clojure
⭐ 16 🌐 Public
java-driver
DataStax Java Driver for Apache Cassandra
⭐ 0 🌐 Public
JFastText
Java interface for fastText
⭐ 0 🌐 Public
kafka-connect-twitter
Kafka Connect Source for Twitter
⭐ 1 🌐 Public
kafka-connect-twitter-1
Kafka Connect connector to stream data in real time from Twitter.
⭐ 0 🌐 Public
kafka-streams-experiments
Experiments with Kafka Streams
⭐ 0 🌐 Public
kafka-streams-playground
A few examples for Kafka Streams
⭐ 0 🌐 Public
ksql-exps
Experiments with KSQL
⭐ 0 🌐 Public
lein-hadoop
leiningen plugin for generating hadoop-compatible jars
⭐ 2 🌐 Public
lein-simple-project
Example of project, that uses Leiningen
⭐ 3 🌐 Public
merchant-classification
This series of notebooks shows how the Lakehouse for Financial Services enables banks, open banking aggregators and payment processors to address the challenge of merchant classification
⭐ 0 🌐 Public
migrate
No description
⭐ 0 🌐 Public
mlflow
Open source platform for the machine learning lifecycle
⭐ 0 🌐 Public
mlflow-webhook-azure-devops
No description
⭐ 4 🌐 Public
muse
Emacs MUSE
⭐ 56 🌐 Public
neo4j-spark-connector
Neo4j Connector for Apache Spark, which provides bi-directional read/write access to Neo4j from Spark, using the Spark DataSource APIs
⭐ 1 🌐 Public
NETL-Automatic-Topic-Labelling-
Generating labels for topics automatically using neural embeddings
⭐ 1 🌐 Public
nlp_model_selection_app
No description
⭐ 0 🌐 Public
nlu
1 line for hundreds of NLP models and algorithms
⭐ 1 🌐 Public